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CN ■ Abstract 

A simple model of a frustrated disordered system is presented. Apart 
from the (very different) physical interpretation, the model shares many 
(NJ ' features with that of Sherrington-Kirkpatrick for spin glasses, but, as a 

0^ , consequence of its relative simplicity, its ground state can be exactly de- 

termined by numerical methods. This fact allows us to test experimen- 
(^ ■ tally some theoretical predictions, based on a specialization of the "cav- 

^" , ity method" developed for the SK model, which is presently limited to 

a "non-frustrated" approximation, corresponding to some extent to the 
j^ ■ replica-symmetric one for the SK model. 
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1. Introduction 

The Sherrington-Kirkpatrick model for spin glasses ([1], SK model in the following) was 
proposed in 1975 as a simple and exactly solvable model of a disordered frustrated system. 
This is a model with very long range interaction, for which the mean field theory is supposed 
to hold exactly, and therefore it should be easy to handle. But the hopes of the authors 
were soon disappointed: the model is exactly solvable, but its solution turned to be much 
more complex than what initially thought. Only in 1980 the work of Parisi [2] and the 
complicated and mysterious "breaking of the replica symmetry" solved the problem, and 
only a few years later the so called cavity method [3,4,5,6] revealed the physics hidden 
behind the intricate mechanism of the replica formalism. 

Two features appear to be essential in the model, both linked to a highly non-trivial 
decomposition of the Boltzmann measure in pure states: 

1) the pure states are organized in an ultrametric scheme [3,4]; 

2) the free energies at each level of this organization are Poissonian random variables with 
exponential density, and the levels of organization are connected through a probability 
cascade a la Ruelle [5,7]. 

It is not yet clear whether this picture can be applied to other frustrated disordered 
models, such as 3-dimensional spin glasses with nearest neighbours interaction [8,9].*^*-' For 
this reason, we have considered a model which preserves many features of the SK model, 
but is much simpler to be analyzed numerically. Our hope is twofold: to clarify if and 
how one can apply the same "ultrametric" picture as the SK model, and to study the 
nearest neighbours interaction case, taking advantage of a better numerical tractability. 
This letter represents a first step in this direction. 

The outline of the paper is as follows. In section 2 we define the model, in section 3 
we clarify the concept of frustration in the context of our model, in section 4 we propose 
two approximate analytical solutions of the model, in section 5 we show some numerical 
results, as a test for the analytical ones. 



2. Definition of the model 

We consider a set of points V, connected through links of a given length, defining the 
distance between each pair of points of V. We start by stating the problem with the 
greatest generality, with the points of V not necessarily distributed in the real space, and 
their relative distances not necessarily being the euclidean ones. To be precise, let us recall 
a few definitions. A weighted graph is a triple G = {V, A, d) where V^ is a finite set of nodes 
or vertices, A G V x V is the set of the arcs (i.e. the links) which connect the nodes of V 
and the map d : A -^ R"*" gives the length (or weight) of an arc, i.e. the distance between 



^*> Models like the REM and the GREM of Derrida [10,11], though very instructive, are 
rather artificial. 



the nodes of an (ordered) pair. A path of n steps in the graph starting from u and leading 
to f is a sequence of n + 1 nodes {wi}^=o such that 

uo = u , Un = v, Ui^Uj ^ ij = 0,...,n i ^ j 
{ui, Ui+i) e A Vi = 0, ...,n-l 

Let now Cu,v be the set of the paths joining u to v, irrespective of the number of their 
steps. We suppose that the graph is connected, i.e. that there is at least one path joining 
u to V. The length of paths in Cu,v is represented by a map / defined on Cu,v in a natural 
way, i.e. / : C^^y — > R""" such that 

n-l 

c e Cu,v , c = {Wi}7=o ' l[c\ = ^ d{ui, Ui+i) (2) 

i=0 

Our system has phase space 

^ = Cs,t 

with s and t fixed in advance, and Hamiltonian 

The ground state of the system is the shortest path in G between s and t. Using a 
standard algorithm, the Dijkstra's algorithm (see for instance [12] and references therein), 
it can be determined in a time which grows not too fast with the size of the problem, for 
any realization of the distance matrix d{u, v). On the contrary, to find the ground state of 
the SK model is a so called A^P-complete problem (see [12] and also [13]) and therefore it 
is very unlike that a "good" algorithm can be found for it. 

The distances d{u^v) play the same role of the couplings Jij among the spins in a 
generic model of interacting Ising spins. In the case of our interest they will be random 
variables and will represent the "disorder" of the model. How frustration arises in this 
context will be discussed in the next section. As anticipated, our main interest in studying 
the problem is to gain informations about the behaviour of disordered and frustrated 
systems, in a context which has a rather rich structure, but nevertheless is simpler to 
handle than spin systems. Apart from this rather abstract motivation, the study of our 
problem could be useful in the framework of high-temperature expansion for disordered 
systems of interacting Ising spins. Moreover, also the problem of directed polymers in 
a random medium (see, for some recent results, the work in [14] and references therein) 
could be stated as a particular case of our problem. In fact, a directed polymer is a path 
in a particular graph which has a privileged direction of motion: that is to say, with the 
previously introduced notation, that if the link (-u, v) E A points in the privileged direction 
then {v,u) ^ A (the reversed link is absent). With our notation the random medium is 
represented by a distance matrix written in terms of a single random function (/>, the local 
disorder, 

(l){u) + (pjv) 
a{u, V) = 



so that equation (2) for the length of a path c of n steps becomes 

(/){uo) + (/){Un) 



n 



1=0 

which is the usual potential energy term in the hamiltonian of a polymer in a local disorder 
field, apart from a contribute vanishing in the limit n ^ 1. 

The model we shall consider in the following is defined by: 

1) (w, v) E A for every u,v E V, with u j^ v; 

2) the (i('U, t')'s are independent identically distributed random variables (with d{ujv) = 
d{vju)). 

Statements 1) and 2) are the counterparts of the defining characterization of the SK model, 
the ones by which mean field theory turns out to hold exactly. This analogy makes us 
confident that one could find an exact solution for our model, although the meaning of 
"mean field theory" in our context is not clear. Moreover, the fact that from a numerical 
point of view the problem is much simpler to deal with than the SK model, lead us to hope 
that the same happens also in an analytical approach. We did not succeed in finding such 
an exact solution, nevertheless thanks to the 1) and 2) above we can obtain an approximate 
solution (see section 4) which fits well enough the numerical data (see section 5). 
We choose as probability distribution for the d 

p^{x) = {a + 1) ■ x"" , a:G[0,l] (3) 

In the analytical computation of section 4, a remains a free parameter, whereas in the 
numerical simulation of section 5 we fix, just for concreteness, a = and a = 1, i.e. we 
consider, respectively, uniform and linear probability distribution of the distances. 



3. Frustration 

It is not immediate to understand how the usual concept of frustration, developed in 
the framework of spin systems, applies to this context. What naturally characterizes 
frustration, in a context-free way, is the existence of a lot of local minima, hampering the 
selection of the global one. Although in our model we can not precisely identify the local 
minima, we propose a generic definition of frustration, which easily implies that our model 
is frustrated. 

It is now useful to introduce another model, structurally similar to ours. Let us recall 
that a tree is a connected graph without cycles and a spanning tree of a graph is a tree 
which has the same set of nodes as the graph. The phase space of this second model is the 
set ST of all the spanning trees of G, and the Hamiltonian is the map weight w : ST -^ R"'" 
naturally defined by 

TeST.T^iV.Ar), w[T] ^ Yl ^(^'^) (4) 

{u,v)eAT 



In the following we shall call minimal spanning tree (MST) the ground state of this system. 
As a matter of fact, the Dijkstra's algorithm, which determines the shortest path from s 
to t, gives also the shortest path from s to any other u E V. These paths form a tree, the 
so called shortest path tree (SPT). Thus, to a certain extent, the SPT problem and the 
MST problem are similar, as their solutions are both represented by trees on the graph 
(they do not coincide, however, as it is easily shown by a counterexample). On the other 
hand the algorithms which solve them are rather different. The Dijkstra's algorithm is 
somewhat tricky. On the contrary, the algorithm which solves the MST problem is much 
simpler: 

MST := ; A' := A ; (5) 

repeat 

let a the arc in A' of minimal length ; 

A' := A' \ {a} ; 

if MST + a is a tree then MST := MST + a ; 

until A' = ; 

This algorithm is called greedy for obvious reasons. 

Systems for which the greedy algorithm to find the ground state works are called 
matroids. Though this seems to be an informal characterization, it turns to be equivalent 
to a purely algebraic one (see [12]). For matroids there exists a natural way to give a notion 
of neighbourhood between the points in the phase space, that for the spanning trees of a 
graph results in: 

If T E ST then the neighbours of T are all the T' G ST (6) 

obtained from T as follows: add an arc to the tree T , 
producing a cycle; then delete one other arc on the cycle. 

If we consider the local minima of the Hamiltonian w with respect to this "topology" , it 
is an amusing exercise [12, exercise 3 chap. 1] to show that, apart from accidental degen- 
erations, there exists a unique local minimum, which therefore is a global one. Thus the 
following steepest descent algorithm is equivalent to the greedy one, in order to determine 
the MST: 

let T EST ; (7) 

repeat 

let T' the neighbour of T of minimal weight ; 
if T' ^ T then T -.^ T' ; 

until T' = T ; 

MST := T ; 

Now, it seems natural to consider the matroids as the archetypical example of a non- 
frustrated system, in contrast with the frustrated ones, where, as it is well known, the 
huge number of local minima causes the steepest descent method to fail. We remark that, 
to adapt the greedy algorithm to a spin system, it is necessary to orient subsequently the 
spins of the system, so that the couplings with the already oriented ones are satisfied. It 
is not difficult to make the above procedure formal, and to verify that it works if and only 
if for every n-tuple of lattice sites {zi, . . . , z„} the following condition holds: 



where Jij is the couphng constant between the spin ai and the spin aj. This is nothing but 
the usual definition of absence of frustration. In conclusion, two facts lead us to conjecture 
that matroids are the most general non-frustrated systems: 

- there exists a unique local minimum, which therefore is a global one; 

- the spin systems which are matroids are the non-frustrated ones, in the usual meaning. 
From this point of view our model is frustrated, and the MST problem becomes its non- 
frustrated counterpart. To prove that our system is frustrated it is sufficient to show that it 
is not a matroid, and this can be done in two ways: either by its algebraic characterization, 
or by showing with a counterexample that to be greedy does not work. Roughly speaking, 
in order to find the shortest path between s and t, it is not sufficient in general to proceed 
naively, i.e. by choosing always the shortest arc when going out from a node. 

In the following, we shall fix our attention on the shortest path between two fixed 
nodes in a graph, which we denote with s and t. The size of the graph, i.e. the number of 
its nodes, will be denoted by A^, and we shall be interested in the asymptotic properties in 
the limit A^ -^ oo. Moreover, as it will be clear in the following, we shall study, together 
with the shortest path, also the paths in the graph between s and t which are next to the 
shortest one, i.e. we order the paths by increasing length, and we shall be interested not 
only in the first one, but also in the second one, the third one, and so on. These will be 
called "subminimal" or "suboptimal" paths, whereas the shortest path is the minimal, i.e. 
the optimal, one. We do not attempt to give a rigorous definition of this notion, i.e. we 
do not say how close a path should be to the shortest one, to be called a subminimal one. 
However, if we order by increasing length all the paths between s and t in the graph, it 
is clear that the /i-th one, with h fixed, is always subminimal in the limit N ^ oo, since 
it becomes infinitely close to the shortest one. In fact, as far as we could say about our 
problem, by a partial analytical solution and a numerical one, all the shortest k paths in 
the graph, with k fixed, in the limit A^ ^ 1 share the same leading behaviour versus N 
(see (12) below, and section 5). Thus we could take as an informal characterization of the 
subminimal paths the fact that they all have the same asymptotic behaviour versus N, 
when A^ goes to infinity. It is clear that the number of subminimal paths must grow with 
A^, but we do not know at all how. 

We believe that this characterization, though informal, is not trivial. Roughly speak- 
ing, we do not know exactly which paths are the subminimal ones, but we can argue which 
of them are surely not. In fact, let us suppose we are looking for the shortest path be- 
tween s and t which visits also any other node in the graph. Such a path is the so called 
Hamilton path; to find it is a problem very close to the well known travelling salesman 
problem (TSP), which is a so called NP-complete problem (see [12]), i.e. so hard to solve 
as to find the ground state of the SK model. On the other hand, our problem, i.e. the 
problem of finding the shortest path between two nodes in a graph, is one of the so called 
class-P problems, and it is much easier to solve numerically. It is commonly believed (see 
[12]) that A^P-complete problems are drastically different from class-P ones, at least as 
far as the cardinality of the continuum (i.e. the number of points on the real line), is 
drastically different from the cardinality of natural numbers. Thus we argue that the class 
of subminimal paths, i.e. the paths close to the shortest one, is drastically different from 
the class of the paths close to the Hamilton path. 
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4. Two partial solutions 

The following heuristic argument gives some insight on the ground state of our model; but, 
in fact, it is easy to see that the argument should work only in the non-frustrated case, 
i.e. in the MST problem. Nevertheless we will see later that the resulting relations fit the 
numerical data well enough (see (8) and next section). We try now to estimate the length 
of the shortest path between s and t. Starting from s, we select the outgoing arcs shorter 
than e (to be determined), and so on, until we reach t. At each step the average number 
of selected outgoing arcs is 

N^ = N / Pa{x) dx 



where A^ = \V\ is the number of nodes in G. After n steps, the number of reached nodes 
is about A" (if A^ ^ 1). The probability of reaching t in n steps is one if N^ = A, i.e. 



In A 

n 



In A, 
The length of the path thus constructed is approximately 



I — n 

Taking the minimum as e varies, we get 

In A 

n 



J^Pc,{x)dx 



a + 1 
K 



(8) 



n 



(K = -^iie 



We have performed a numerical analysis, measuring the average length of the shortest 
path, and the average number of its arcs, for various values of A and both uniform and 
linear probability distribution of the distances, i.e. for a = and a = 1. The result of the 
analysis is shown in next section: for the time being we stress that the numerical data can 
be fitted with the following functional form, for A ^ 1 (see figures 1 and 2) 

n^ c-lwN -\- C 

I K (8') 

r-~j 

This is in close agreement with (8), and, fairly surprisingly, also the numerical values of 
c and s agree with the ones in (8), i.e. c = s ~ l/(a + 1). This fact will be discussed in 
greater detail in the following. 

This approximate computation can be replaced with a more precise analysis as follows. 
Consider the set Cs,t of all the paths from s to t, ordered by increasing length, and denote 
with P'^^\n, {xi}'^^i) the probability distribution that the /i-th path is composed of n arcs 



of lengths Xi, . . . ,x„. We shall need the following notations: let ak be the length of the 
arc k E A; given a path c G Cg^t^ let Ic be its length, tie the number of its arcs, and 
{fcc}?=i <^ ^ the sequence of arcs which it is made of (so that Ic = X]r=i '^fcO; finally we 
need the characteristic function 

^' = E n e(/,,-g-e(/,-/,,_j---e(/e,-/cJ (9) 

Cl,---,Ch-l c'^Cl,...,Ch-l,C 

(which is 1 if c is the h-th path, elsewhere). Then it is immediate to see that: 

/n 

In order to make the computation of the integral in (10) feasible, we shall make the 
additional assumption 

ric 

keA ceCs,t *=i 

i.e. two different paths do not have common arcs. Assumption (11) should be given 
a suitable sense "in the average" for A^ ^ 1 (since, as it stands, it is evidently false). 
However this is a minor problem, firstly because we could not find better simplifying 
assumptions; secondly, because we believe that to neglect the correlations between the 
paths (as in (11)) corresponds to a kind of non-frustrated approximation, which do not 
suitably fits our problem. We hope that this point will become clearer in the following. 
We omit here the cumbersome algebra, and give only the final results: for A^ ^ 1 with h 
fixed(*) 



and 



, .(u\ \nN 1 r, , ,x , v^ li ^/InA^x 

<">"" = ^TTT + J^ ['"<" + ^' + ^ - ^^ + 5: -]+ o(—) 

n=l 

{n)W ;^Ari/(a+i) l^ (a+l)(n)WJ ^ 1« + -lj- 
^n -O(^) ^ = o( ^ ) 

\^r. at) ' //\2 ^^l„2 T^rJ 



(12) 



(n)2 ^InN' ' (/)2 ^In^N' 



where (n) and (/) denote the mean value, respectively, of ric and Ic in the distribution (10). 
The relations (12) give to leading order the non-frustrated behaviour of (8). In other 
words, this more refined computation with the simplifying assumption (11) gives again, as 
far as the leading behaviour is concerned, the result of the previous heuristic argument, 
i.e. a functional form of the type (8') with c = s = l/(a + 1), which, as we have already 
pointed out, fits well enough the numerical data (see next section and figures 1 and 2). 
An interesting feature of the form (12) is that the leading behaviour versus N, N ^ 1, 
of the number of arcs and of the length of the h-th path, is not dependent on h: this is 



*■*'* IE = 0.5772157. . . is the Euler's gamma constant. 



evident from the fact that the dependence on h in the right hand side of the (12) is only in 
the next-to-leading terms. This feature is very well confirmed by the numerical data (see 
again next section and figures 1 and 2), so that, as we anticipated at the end of previous 
section, we feel authorized to study all the subminimal paths as a whole. As far as the 
complete probability distribution of Ic is concerned, it results: 

,m (13) 

^-(0 = T7^-^ ^f={a + l)!V(-+i) Ari/(«+i) 
N[a + 1) 



The plots of the distributions P*^^''(/), for A^ = 100 and /i = 1, 2 and 3, are shown in 
figure 3, whereas in figure 5 a comparison with the experimental results is exhibited. One 
can see that the qualitative shapes of the distributions (13) and the experimental ones 
are rather different, though their mean values are very close. For this reason we believe 
that the assumption (11), that we called "non-frustrated" and that was at the basis of this 
computation, is not completely satisfactory to fit our problem. 

Another approach to the problem is suggested by the cavity method of Mezard, Parisi 
and Virasoro [6] . We can not pursue this approach beyond a kind of non- frustrated ap- 
proximation, to some extent equivalent to the (11) above, and very reminiscent of the 
replica-symmetric approximation for the SK model. We shall describe briefiy the proce- 
dure. Let Gn = iV]\[,A]\[) be a graph with A^ nodes, and fixed s G Gn let {lu}ueVN\{s} 
be the lengths of the shortest paths in Gn from s to u. Let us denote with Vn{{Iu}) their 
joint probability distribution. Let now Gn+i = {Vn+i^An+i) be the graph with A^ + 1 
nodes obtained by adding a node uq to Vat, and all the arcs between uq and the nodes 
of Vn to An; finally, let {Cu}ueVN be the lengths of these new arcs, with distribution 
P{{Cu}) = YlPoti'^u)- The shortest path from s to uq in Gn+i has length 

/o = min[£s, min {l^ + £„)] (14) 

^^eViv\{s} 

As above, it is straightforward to write the exact form of the probability distribution of 
/o, but there is no point in using it without suitable simplifying assumptions. Namely, we 
shall assume the {lu} to be uncorrelated, that is to say 

vn{{Iu})= n ^^(^-) (15) 

uEVnMs} 

At first glance it is not clear which is the relation between this approximation and the (11) 
above: the latter consists in neglecting the correlations among the paths between the same 
pair of nodes, the first one ignores the correlations between the shortest paths connecting 
a fixed starting node with all the other ones of the graph. However, as we shall see, the 
resulting behaviour for the mean length of the shortest path is exactly the same, though 
the two complete probability distributions differ qualitatively. Now, using the (15) one can 

9 



translate the recursive relation (14) in a recursive relation for the probability distribution 
of the shortest path: writing 



F^(/) = l- / PN{t)dt 
Jo 



we obtain, in a more or less straightforward way, 



Fn+i{1) = [l - / Pc] ■ex.p-N[ Pa- I Pc{l-x)FN{x)dx) 
Jo Jo Jo 



(16) 



If we now assume, apart from negligible terms, 

F^{1)^Fn+i{1)^Fn{1) foriV>l, (17) 

(16) becomes an integral (non-linear) equation in F^. It is possible to justify (17) a 
posteriori. 

If a = the equation reduces to 



F~(/) = (1 - /) ■ exp-A^(/ - f F^{x) dx) 

Jo 



which is equivalent to 



d F°° 

F^(0) = 1 
and can be linearized by putting T = 1/F. At the end we get 

^"W - , , expliVQ - 1 - • ^^(') = -S^«-(0 (19) 

^+ A^(l-/) 

If a 7^ the equation (16), again under the hypothesis (17), must be solved numerically, 
thus giving F^{1) for each fixed value of A^ and a. The resulting distributions -P^(/) differs 

qualitatively from the P^ (0'^ of equation (13) above, even for /i = 1. Nevertheless, as we 
shall see soon, their mean values coincide. A comparison between these two distributions 
and the experimental one is shown in figure 5, for A^ = 100 and both uniform and linear 
probability distribution of the distances, i.e. for a = and a = 1. 

As far as the mean value of / is concerned, if a = the (19) can be easily integrated 
and gives 

f InN 

(/) = y/p^(/)d/^^^ (20) 

10 



in perfect agreement with what is predicted by equation (12). If a 7^ we must solve 
numericaUy the integral equation (16) for some selected values of A^. Then we compute 
(/) and we fit the resulting data with a functional form of the type 

(/) = K' —- h subleading terms 

We have performed such an analysis for a = and a = 1, computing (/) for the same values 
of N of the experimental data, thus giving to the finite-size effects the same relevance. The 
details are given in next section. For a = we obtain 

K' = 1 s = l 

with negligible corrections, in perfect agreement with (20), thus showing that the selected 
values of A^ are not too small. For a = 1, fairly surprisingly, we have found again a very 
good agreement with what is predicted by equation (12), i.e. 



K' = V2/2 s = 1/2 

Thus this approach (suggested by the cavity method of [6] with the assumption (15)), as 
to the leading behaviour of the length of the shortest path, is equivalent at all to the one 
which leads to equation (12), i.e. the direct one with the assumption (11). 

As anticipated, in next section we shall see that the non-frustrated behaviour of equa- 
tion (12), shared by both our analytical approaches, fits well enough the experimental 
data for (/) and (n). As we shall see, however, the agreement is not beyond any doubts, 
so we believe that it would be essential to improve the rough approximation (15), thus 
obtaining a more refined prediction than equation (12). An idea for a future work could 
be the following. 

Let us assume that the Dijkstra's algorithm has given the SPT for the graph Gn with 
N nodes; then we dispose of the exact numerical values of the lengths lu of the shortest 
paths from s to any other node u G Vn. If now we add the node -uq, obtaining Gat+i, 
the length /q of the shortest path from s to uq is given by (14). What can be said now 
about the other lengths lu of the shortest paths from s to the nodes of Vat? Of course, 
they will be affected by the addition of uq, in analogy to what happens in the SK model 
when applying the cavity method. In fact the addition of one spin to an A^-spin system 
produces a rearrangement of the relative weights of the configurations in a pure state, 
thence a minor rearrangement of the relative weights of the pure states in a cluster and 
so on. The cavity method allows to keep track of this cascade of rearrangements (see [6] 
for details) . The crucial observation was the discovery of the cluster structure of the pure 
states in the SK model, the so called ultrametric organization (see [4]). We do not yet know 
how to implement an analogous procedure in our model, though it seems to be natural 
that an improvement of the rough approximations (15) or (11) should be required. In 
some sense the approximations we used here correspond to a "replica-symmetric" solution, 
which does not take into account the rearrangement of the pure states. This would be 
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exact if the system was not frustrated (i.e. with only one pure state, apart from accidental 
degenerations). 

The first condition, in order to start the program sketched above, is to find out what 
plays the role of the SK pure states in our model. Let us therefore consider the following 
argument. After adding the node uq to the graph Gn obtaining Gn+i, one can apply 
the Dijkstra's algorithm to find the SPT in Gn+i- Take now a node u in Vn- If the 
shortest path in Gn+i from s to u does not pass trough uq, then it will coincide with the 
old shortest path in Gn- If, on the contrary, it passes through uq, it is interesting to see 
how much "close" (in a suitable sense) it is to the old shortest path in Gn- In general, 
there is no reason to expect these two paths to be close, because of the frustration of the 
model, but one can reasonably suppose that the new path runs close to an old subminimal 
path in Gn from s to u. Therefore in our numerical analysis we have pointed out our 
interest on the suboptimal paths a little longer than the minimal one: can we interpret 
the suboptimal paths as the pure states (in a suitable sense) of our model? Let us stress 
that to leading order for A^ ^ 1 the lengths (/)*^^^ of the suboptimal paths coincide, as 
it results from (12) and is confirmed by numerical analysis (see next section). Otherwise 
the identification between subminimal paths and pure states will fail: in the SK model the 
free energies are equal in the ther mo dynamical limit. 



5. Numerical results 

As pointed out above, we have performed some numerical computations to test the validity 
of the scaling laws (8) , and to measure the probability distribution for the lengths of the 
subminimal paths. The algorithm used is a clever generalization of the Dijkstra's algorithm 
[15]. It determines, for a given realization of the distance matrix d{u, v), the first k paths of 
Cs,t ordered by increasing length. We have considered only /c = 3, because the CPU-time 
spent in the computation grows rather fast with k. We made several runs for a = and 
a = 1, i.e. with the probability distribution in (3) respectively uniform and linear, and 
for different values of A^, the size of the graph, ranging from A^ == 10 to A^ = 500. In 
each run we measure the length and the number of arcs of the paths for a large number of 
independent realizations of the distance matrix. More precisely we made 200.000 iterations 
for each value of a for A^ = 10, 20, 50, 100; 100.000 iterations for A^ = 200; 40.000 iterations 
for A^ = 500. The runs required about 150 hours of CPU on a DEC VAX 6000-520, running 
VAX Pascal. The runs to measure the probability distribution, being purely qualitative 
and not requiring a high precision, were made only for A^ = 100 and low statistics: 30.000 
iterations for each value of a. 

In figure 1 and 2 we plot respectively (n)*-^-* and {lY^'/{n)^^ versus the size of the 
graph A^, for h = 1,2,3 and both probability distributions for the distances (a = and 
a = 1). The scaling laws that qualitatively appear in these plots are the ones anticipated 
in equation (8'), i.e. for A^ ^ 1 



(n)(^) ^c-lnA^ + C^'^) 

(/)(^) K 

, , ,, , ~ — — independent on h 

(n)(^) A^'^ 
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We want to stress once more that to leading order in A^ these scahng laws do not depend 
on h. 

To be sure that the leading behaviour for the "physical" quantities (n) and (/) is the 
non- frustrated one of equation (8) and (12), we should be more quantitative, and show 
that 

c^s^^ (21) 

a + 1 

We cannot simply perform a naive least-squares fit, as our data could not have reached 
their asymptotic behaviour, because of the low involved values of A^. Therefore our data 
have been fitted with curves of the form 

InA^ 
(n) =c-lnN + C + B—— 

(n) N'^ InN^ 

for various Gxed values of B, applying the Hatness criterion advocated in [16, sections 4.2 
and 5.3]. The S-dependent term should take into account the corrections to the leading 
behaviour, and its form was suggested by equation (12). Although we believe that this 
technique has been now generally accepted, in the appendix we shall spend some words 
of explanation. The result of the analysis for (n) and {l)/{n) is given in table 1 and 2. 
From these data we see that relation (21) is verified well enough. Sufficiently well verified 
is also the fact that K is independent on /i, at least for a = 1. As to the value of C^'^' the 
agreement with equation (12) is only qualitative. 

Nevertheless the fitted values for c and s in table 1 and 2 are different from what 
is predicted by equation (21) by more than their error, though this should be a 95% 
confidence limit. Thus we conclude that, as we anticipated at the end of the previous 
section, the use of the non-frustrated leading behaviour (12) to fit our experimental data 
is not completely satisfactory. We strongly believe that the finite size errors are not 
underestimated, because the same procedure of fitting performed on the theoretical data 
leads to confirm the validity of (21) exactly, i.e. at 95% confidence level. More precisely 
we have proceeded as follows. We have generated a set of values for (/) integrating the 
theoretical distributions of equation (13) and (16), for the same values of A^ of the real 
data, and with fictitious statistical errors equal to the real ones. Then we have performed in 
parallel the same fit, with the same procedure of the previous analysis, on the experimental 
data and the theoretical ones, which so work as a placebo. The result is given in table 3: 
the data have been fitted with the functional form 

where again the corrections are suggested by equation (12). We see that the theoretical 
data give the expected values well within their error, with some troubles for h = 1, not 
only for the exponent s {i.e. l/(a + 1)), but also for the constant K' {i.e. 1 for a = 
and a/2/2 for a = 1, see (12)). As to the experimental data, though the fiatness region 

13 



in B is larger, thus producing a systematic error larger, the fitted value for s differs from 
l/(a + 1) by more than its error, and the discrepancy becomes stronger when h increases. 

In figure 3, 4 and 5 we show the various probability distributions for the length of 
the subminimal paths for A^ = 100 and both a = and a = 1. In figure 3 we plotted the 
distributions P^^'{1) of equation (13) for /i = 1, 2 and 3. Figure 4 presents the experimental 
distributions for /i = 1, 2 and 3, obtained by smoothing the histogram of the experimental 
data. More precisely we filled a histogram with the experimental data for /, for different 
realizations of the distance matrix (200 channels for 30.000 data, no one of the data falls 
out of the considered range) , then we averaged the counting of each channel with the H 
preceeding and the H following it (H = 5 for a = and H = 2 for a = 1), thus obtaining 
a smoother plot. Lastly, as far as the length of the shortest path is concerned, in figure 5 
we compare the distribution P^^\l) with /i = 1 of figure 3, the experimental distribution 
of figure 4, and the distribution of the "cavity method", i.e. the one obtained by solving 
equation (16), exactly for a = (see (19)), and numerically for a = 1. 

As a comment to figure 5 we remark again (this fact has been yet noted in the previous 
section) that the two theoretical distributions and the experimental one differ qualitatively 
from each others, at A^ fixed, though the first two distributions share the same leading 
behaviour versus N as to their mean value, whereas the third one gives a significatively 
different behaviour. 



Appendix 

In this appendix we sketch briefly the contents of the analysis of the corrections to the 
scaling a la Berretti-Sokal [16]. In our case we measure (/) and (n) for some values of 
A^, the size of the graph, and we would extrapolate from the data the scaling behaviour 
versus N. Because of the involved relatively small values of A^, we can not be sure that 
the data have reached their asymptotic behaviour, so we must take into account possible 
corrections to the scaling. Firstly we make an Ansatz for the corrections to the leading 
behaviour, as in (22). The coeflicient B of the corrections in (22) is not subject to a flt, 
although this would be easily made by well known non-linear methods of fltting, because 
this would not solve our problem, i.e. to understand to what extent the fltted parameters 
are reliable as true asymptotic values, and not merely as effective values, changing as the 
range in A^ increases. In fact we do not know exactly which the sub leading corrections are, 
and also, if we knew them, we could not include in a flt other but the flrst relevant ones, 
as our data are only for a few values of A^ (6, in our case). 

Thus we flx the value of B, we flx also a value Ncut for A^, and we flt the data with 
the curves in (22) only for A^ > Ncut- Varying Ncut at B flxed, if we observe a systematic 
dependence on Ncut of the fltted parameters (e.g. a systematic decrease of the fltted 
exponent s) , we say that the chosen value of B can not take into account the corrections 
to the scaling. Now, by varying B, we select the range in B for which the fltted parameters 
are flat (i.e. do not vary in a systematic way), with respect to Ncut- We say, following [16], 
that for B in this range equation (22) takes into account the corrections to the scaling as 
an effective correction, even if the exact form for the corrections is different from that we 
imposed, and even if we rule out some other (more irrelevant) terms. In this range of B, 
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which we caU the "flatness region" , for each fitted parameter we select the maximum and 
the minimum value. The best fit for the parameter will be simply the arithmetic mean 
of these values, whereas their difference will be considered as the systematic error due 
to unconsidered corrections to the scaling, or to imperfect knowledge of the form of the 
corrections (95% subjective confidence limit as defined in [16, footnote 17]). In addition 
we quote the usual statistical error for the fit, at 95% confidence level {2a). 
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Captions for the figures and tables 

Figure 1. (n)^ "> versus N, with uniform (a) and linear (6) probability distribution for 
the distances. 

Figure 2. The same as in figure 1 for (l)^^^ / {n)^'^K 

Figure 3. Distributions P^^\l) for the length of the h-th path as in equation (13), for 
A^ = 100 and uniform (a) and linear (6) probability distribution for the distances. 

Figure 4. The same as in figure 3 for the measured distributions Pexpil), obtained by 
smoothing the histogram of the experimental data. 

Figure 5. Three probability distributions for the length of the shortest path, with A^ = 
100 and uniform (a) and linear (6) probability distribution for the distances. P^^\l) is 
the distribution quoted in (13), as in figure 3, for h = 1; exp denotes the experimental 
distribution, as in figure 4, for h = 1; cav denotes the distribution of the "cavity method" 
(see equation (16)). 

Table 1. Best estimates for c and C in the fit {n)^^> = c ■ In A^ + C, with corrections to 
the leading behaviour inserted (see (22)). The first error is the systematic error due to 
unconsidered corrections to the scaling (95% subjective confidence limit as defined in [16, 
footnote 17]) and the second error is the usual statistical error (95% confidence interval). 
It is also quoted the range in B which obeys to the fiatness criterion. 

Table 2. Best estimates as in table 1 for s and K in the fit {l)^^^ / {n)^^'^ = K/N-% with 
corrections to the leading behaviour inserted (see (22)). 

Table 3. Best estimates as in table 1 for s and K' in the fit (/)('') = K' ■ In N/N"", 
with corrections to the leading behaviour inserted (see (22')). The fit has been performed 
for both the experimental data (denoted by exp) and the theoretical ones. These last 
were produced by integrating the two theoretical probability distributions for /, the one of 
equation (13) and the one of the cavity equation (16). 
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Table 1. 





c 


C 


B 




a = 










h = l 


1.0070 ±0.0079 ±0.0023 


-0.475 ± 0.041 ± 0.008 


0.87^ 


0.99 


h = 2 


1.0363 ±0.0098 ±0.0024 


0.316 ±0.051 ±0.009 


-0.20^ 


-0.05 


h = 3 


1.0627 ±0.0179 ±0.0125 


0.650 ±0.095 ±0.060 


-0.60 ^ 


-0.40 


a = 1 
h = l 


0.5051 ±0.0108 ±0.0025 


0.283 ±0.061 ±0.010 


-0.55^ 


-0.30 


h = 2 


0.5051 ±0.0087 ±0.0074 


0.781 ±0.051 ±0.036 


-0.50^ 


-0.30 


h = 3 


0.5138 ±0.0129 ±0.0026 


0.971 ±0.073 ±0.011 


-0.30^ 


-0.00 



Table 2. 





s 


K 


B 


a = 








h = 1 


1.0191 ±0.0118 ±0.0021 


1.246 ±0.131 ±0.012 


0.25^0.50 


h = 2 


0.9914 ±0.0106 ±0.0016 


0.844 ± 0.094 ± 0.006 


1.70^2.10 


h = 3 


0.9780 ± 0.0070 ± 0.0032 


0.675 ±0.049 ±0.010 


2.90^3.20 


a = 1 
h = l 


0.4981 ±0.0102 ±0.0018 


1.361 ±0.119 ±0.011 


-0.08^0.10 


h = 2 


0.4920 ± 0.0076 ± 0.0013 


1.265 ±0.084 ±0.008 


0.20^0.35 


h = 3 


0.4948 ± 0.0076 ± 0.0012 


1.290 ±0.086 ±0.007 


0.20^0.35 



Table 3. 





s 


K' 


E 




a = 










exp h = 1 


1.0077 ±0.0077 ±0.0013 


1.098 ±0.074 ±0.007 


0.20 4- 


0.35 


exp h = 2 


0.9874 ± 0.0081 ± 0.0008 


0.814 ±0.075 ±0.003 


2.50 4- 


2.90 


exp h = 3 


0.9667 ±0.0044 ±0.0005 


0.571 ±0.034 ±0.001 


5.50 4- 


5.90 


eq. (16) 


1 


1 







eq. (13) h = 1 


0.9935 ± 0.0038 ± 0.0007 


0.932 ±0.030 ±0.003 


-0.36 4- 


-0.43 


eq. (13) h = 2 


0.9998 ±0.0011 ±0.0003 


0.997 ±0.009 ±0.001 


0.415 4- 


0.445 


eq. (13) h = 3 


0.9999 ±0.0013 ±0.0002 


0.999 ±0.012 ±0.001 


0.910 4- 


0.935 


a = l 
exp h = 1 


0.5063 ±0.0041 ±0.0007 


0.763 ±0.027 ±0.002 


0.16 4- 


0.24 


exp h = 2 


0.4990 ± 0.0035 ± 0.0004 


0.692 ±0.025 ±0.001 


1.514- 


1.63 


exp /i = 3 


0.4931 ± 0.0033 ± 0.0004 


0.638 ±0.023 ±0.001 


2.40 4- 


2.55 


eg. (16) 


0.4994 ± 0.0025 ± 0.0006 


0.702 ±0.014 ±0.002 


0.29 4- 


0.33 


eg. (13) /i = 1 


0.4951 ± 0.0025 ± 0.0006 


0.670 ±0.013 ±0.002 


0.25 4- 


0.29 


eq. (13) /i = 2 


0.4999 ± 0.0016 ± 0.0002 


0.706 ±0.010 ±0.001 


1.10 4- 


1.14 


eg. (13) h = 3 


0.5000 ±0.0010 ±0.0001 


0.707 ±0.007 ±0.001 


1.60 4- 


1.63 



